NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

History-Guided Video Diffusion

Song, Kiwhan; Chen, Boyuan; Simchowitz, Max; Du, Yilun; Tedrake, Russ; Sitzmann, Vincent (July 2025, 2025 Forty-Second International Conference on Machine Learning)

Classifier-free guidance (CFG) is a key technique for improving conditional generation in diffusion models, enabling more accurate control while enhancing sample quality. It is natural to extend this technique to video diffusion, which generates video conditioned on a variable number of context frames, collectively referred to as history. However, we find two key challenges to guiding with variable-length history: architectures that only support fixed-size conditioning, and the empirical observation that CFG-style history dropout performs poorly. To address this, we propose the Diffusion Forcing Transformer (DFoT), a video diffusion architecture and theoretically grounded training objective that jointly enable conditioning on a flexible number of history frames. We then introduce History Guidance, a family of guidance methods uniquely enabled by DFoT. We show that its simplest form, vanilla history guidance, already significantly improves video generation quality and temporal consistency. A more advanced method, history guidance across time and frequency further enhances motion dynamics, enables compositional generalization to out-of-distribution history, and can stably roll out extremely long videos.
more » « less
Free, publicly-accessible full text available July 17, 2026
Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

Chen, Boyuan; Martí_Monsó, Diego; Du, Yilun; Simchowitz, Max; Tedrake, Russ; Sitzmann, Vincent (September 2024, Neural Information Processing Systems 2024)

This paper presents Diffusion Forcing, a new training paradigm where a diffusion model is trained to denoise a set of tokens with independent per-token noise levels. We apply Diffusion Forcing to sequence generative modeling by training a causal next-token prediction model to generate one or several future tokens without fully diffusing past ones. Our approach is shown to combine the strengths of next-token prediction models, such as variable-length generation, with the strengths of full-sequence diffusion models, such as the ability to guide sampling to desirable trajectories. Our method offers a range of additional capabilities, such as (1) rolling-out sequences of continuous tokens, such as video, with lengths past the training horizon, where baselines diverge and (2) new sampling and guiding schemes that uniquely profit from Diffusion Forcing's variable-horizon and causal architecture, and which lead to marked performance gains in decision-making and planning tasks. In addition to its empirical success, our method is proven to optimize a variational lower bound on the likelihoods of all subsequences of tokens drawn from the true joint distribution.
more » « less
Full Text Available
Oracle-efficient smoothed online learning for piecewise continuous decision making

Block, Adam; Simchowitz, Max; Rakhlin, Alexander (July 2023, Conference on Learning Theory)

Full Text Available
Beyond No Regret: Instance-Dependent PAC Reinforcement Learning

Wagenmaker, Andrew; Simchowitz, Max; Jamieson, Kevin (January 2022, Proceedings of Machine Learning Research)

Full Text Available
Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes

Wagenmaker, Andrew J; Chen, Yifang; Simchowitz, Max; Du, Simon; Jamieson, Kevin (January 2022, Proceedings of Machine Learning Research)

Full Text Available
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach

Wagenmaker, Andrew; Chen, Yifang; Simchowitz, Max; Du, Simon S; Jamieson, Kevin (January 2022, International Conference on Machine Learning)

Full Text Available
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach

Wagenmaker, Andrew J; Chen, Yifang; Simchowitz, Max; Du, Simon; Jamieson, Kevin (January 2022, Proceedings of Machine Learning Research)

Full Text Available
Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes

Wagenmaker, Andrew; Chen, Yifang; Simchowitz, Max; Du, Simon S; Jamieson, Kevin (January 2022, International Conference on Machine Learning)

Full Text Available
Towards a Dimension-Free Understanding of Adaptive Linear Control

Perdomo, Juan C; Simchowitz, Max; Agarwal, Alekh; Bartlett, Peter (October 2021, Proceedings of Thirty Fourth Conference on Learning Theory)
null (Ed.)
Full Text Available
Towards a Dimension-Free Understanding of Adaptive Linear Control

Perdomo, Juan; Simchowitz, Max; Agarwal, Alekh; Bartlett, Peter L. (July 2021, Proceedings of the 34th Conference on Learning Theory (COLT2021))
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records